Molecular Spiders with Memory 
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Synthetic bio-molecular spiders with "legs" made of single-stranded segments of DNA can move 
on a surface which is also covered by single-stranded segments of DNA complementary to the leg 
DNA. In experimental realizations, when a leg detaches from a segment of the surface for the first 
time it alters that segment, and legs subsequently bound to these altered segments more weakly. 
Inspired by these experiments we investigate spiders moving along a one-dimensional substrate, 
whose legs leave newly visited sites at a slower rate than revisited sites. For a random walk (one-leg 
spider) the slowdown does not effect the long time behavior. For a bipedal spider, however, the 
slowdown generates an effective bias towards unvisited sites, and the spider behaves similarly to 
the excited walk. Surprisingly, the slowing down of the spider at new sites increases the diffusion 
coefficient and accelerates the growth of the number of visited sites. 



I. INTRODUCTION 

Chemists have recently constructed various synthetic 
molecular systems (see e.g. p| [H, [1] and a review 
which can move on surfaces and tracks. One class of 
such objects aptly named spiders [|[ consists of relatively 
small molecules with legs which are short single-stranded 
segments of DNA. These spiders can move on a surface 
covered with single-stranded DNA segments, called sub- 
strates. The substrate DNA is complementary to the leg 
DNA. The motion proceeds as legs bind to the surface 
DNA through the Watson-Crick mechanism, then disso- 
ciate, then rebind again, etc. More precisely, a bond on 
the substrate with an attached leg is first cleaved Q, 
and the leg then dissociates from the affected substrate 
(which is called product ||). After that the leg can re- 
bind to the new substrate or to the product. 

The rate of attachment of a leg of a spider to the sub- 
strate and the rate of detachment from the substrate are 
different from the corresponding rates involving the prod- 
uct. Therefore for the proper description of the motion 
of a single spider one must keep track of its entire trajec- 
tory. This memory requirement makes the problem non- 
Markovian @, 0|- Such problems tend to be intractable, 
though in one dimension non-Markovian problems are 
occasionally solvable, see e.g. @, H, [13, Hll.ll3|- 

In this paper we continue the investigation of molecu- 
lar spiders that move along one-dimensional tracks [l3l ]. 
The goal is to study the interplay between the spiders and 
their environment which has been ignored in Ref. [l3|. 
We shall assume that each leg of a spider steps indepen- 
dently and symmetrically to neighboring empty sites at 
rates 1 from revisited sites and at rates r from sites vis- 
ited for the first time. For molecular spiders r < 1 since 
legs are bound more firmly to new sites. Two types of 
spiders were introduced in [131 ] which differ in the type 
of constraint that keeps the legs close to each other. For 
local spiders or centipedes, the distance between adja- 
cent legs is < s; for global spiders, the distance between 
any two legs is < S. Without memory (r = I), spiders 
with local constraint can be mapped onto exclusion pro- 



cesses with open boundaries, while spiders with global 
constraint map to the simple exclusion process with pe- 
riodic boundary conditions jl3j |. These mappings sim- 
plify the computations of the diffusion coefficient and, 
e.g., they allow to determine the diffusion coefficient for 
general global spiders with an arbitrary number of legs L 
and an arbitrary maximal distance S, and for local L-\eg 
spiders with s = 2. 

The non-Markovian character does not make the prob- 
lem hopeless since the visited area is very simple, namely 
the visited sites (product) form an island in the sea of 
unvisited sites. (This happens only in one dimension, 
and the two-dimensional problem appears intractable.) 
Graphically, we display a leg of the spider by •, an empty 
sites as o, and we put a hat at sites which have never been 
visited earlier, i.e. where the DNA substrate is still un- 
cleaved. A typical configuration for a bipedal spider then 
looks like 

...6666ooooooo»o»6666... 
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The memory solely acts through the slowdown of legs 
on newly visited sites. For a one-leg spider, which 
is merely a random walk, memory does not affect the 
asymptotic behavior of the number of visited sites and 
other quantities. On the other hand, the slowdown gen- 
erates an effective bias of multi-pedal spiders. This bias 
always points outward. Remarkably, the slowdown effec- 
tively speeds the spider up, that is, it leads to the increase 
of the diffusion coefficient and the number of visited sites. 

To mimic the effective bias of the spiders, we investi- 
gate a random walk which jumps asymmetrically from 
newly visited sites. This model is similar to the so-called 
exited walk which has been actively investigated in recent 
years in the mathematics literature, see e.g. [1,0, and 
references therein. 

The outline of this paper is as follows. In Sec. UH we 
analyze a random walk with memory. First we analyze 
the symmetric random walk that slows down on the new 
sites, and then we obtain exact results for the exited walk 
which additionally takes into account a bias on the new 
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sites. In Sec. IIII1 we discuss the behavior of the bipedal 
spiders with maximal leg distance s = 2. Some of the 
properties of these spiders are exactly computed, while 
others are established via the analogy with the excited 
random walk. In Sec. IIV1 we generalize these results 
to global spiders with L — S. We draw conclusions in 
Sec. [V] Finally, in an appendix, some of the details re- 
quired for calculations of Sec. [TT] are presented. 



for the mean exit time, which simplifies to 
(T(x + l))-2(T(x)) + (T(x -!))■■ 



(3) 



Solving this equation subject to the boundary conditions 
T(0) = T(M) = we obtain 



(T(x)) 



x(M - x) 



(4) 



II. EXCITED RANDOM WALK 

In this section we consider the case of a one-leg spi- 
der. We shall begin with a nearest-neighbor symmetric 
random walk on the one-dimensional lattice. By defini- 
tion, if the walker occupies a site for the first time, the 
hopping rate to each of its nearest neighbors is r; if the 
walker has already visited this site in the past, it hops at 
rates 1 to the left and to the right. 



A. Average time to cover N sites 

A quantity which is most readily computable is the 
average time (TV) for the random walk to cover TV sites. 
The total number of distinct sites visited by the random 
walk grows with time, and let TV be the time when the 
number of distinct sites first reaches N+l (here N is the 
size of the island and the 1 accounts for the site with the 
random walker on it). Graphically, 
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The walk starts at some site and hence To = 0; all the 
following times < T\ < T% < . . . are random quantities. 
We shall actually compute the first two moments of TV. 
The first moment (the average) is 



N 



N(N-1) N 
4 + 27 



(1) 



To derive |T]) we first recall 
exit time from the interval (0, 
hopping at unit rates to the 
definition, the exit time T(x) 
walker to reach any of the two 
that the walker starts at site x. 
e.g. in [14]) is to analyze chan; 
interval dt. One writes 



T(x) = dt 



how to compute the mean 
M) for the random walker 
right and to the left. By 
is the time for the random 
boundaries, or M, given 
The procedure (explained 
:es in an infinitesimal time 
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and making the averaging one arrives at 

(T(x)} = dt+(l- 2dt)(T(x)) 

+ dt[(T(x + l)) + (T(x-l))] (2) 



Returning to our problem, let us compute the time in- 
terval tjv during which the number of visited sites jumps 
from JV + 1 to AT + 2. Without loss of generality we can 
assume that at TV the visited sites are 1, . . . , N + 1, and 
the random walk is at site N + l. Similarly to ([2]) we 
write an equation for the mean duration 



(tjv) = (1 - 2rdt)(r N ) +rdtN + dt 



(5) 



The factor N in the middle term on the right-hand side 
was derived by taking into account that if the random 
walker hops to the left to site N, the number of distinct 
sites will increase when the random walk reaches either 
site or N + 2. This latter problem is equivalent to 
the above exit problem after identifying M = N + 2 and 
x = N, and Q shows that the mean exit time is N. 
From Eq. {5} we obtain 



Using 
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JV-l 



N 
~2 



(6) 



(7) 
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in conjunction with ^ we arrive at (JTJ) . 

If the hopping rates were equal to unity everywhere 
(no difference between the substrate and the product), 
we would have 



uniform 



N(N + 1) 



(8) 



instead of ([T]). Comparing these two results we see that 
only the sub-leading corrections (linear in N) are differ- 
ent. Hence the non-Markovian nature has an asymptot- 
ically negligible influence on (T/v) for the random walk. 

Other calculations have led to the same conclusion. 
For instance, we have computed (Tjy) and we found 



(T 2 N ) 
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For r — 1 (no memory) equation @ becomes 



(T 2 N ) uniform 



12 



N 3 +N 2 
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N (9) 



(10) 
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i.e. it has the same leading behavior. 

In the r <C 1 limit, memory is very important in the 
earlier stage; the crossover time t c after which memory 
becomes irrelevant is estimated by equating the linear 
and quadratic (in N) terms in ([!]). This gives 



t. 



(11) 



Thus the memory does not affect the asymptotic behavior 
of (Tjv) for the nearest-neighbor random walk. 

This conclusion may be affected by the gait of the walk. 
Unfortunately, even for the random walk it is hard to 
probe complicated gates when we take memory into ac- 
count. For instance, if the random walk makes ±1,±2 
hops, say all at the same rates, the problem seems in- 
tractable. The above approach allows to compute (Tn) 
only when the random walk hops to nearest-neighbor 
sites or gets back to its original site. For such gait we 
found 



(F 



N) 



1 

37 



1 

12 



N 



(12) 



hence the influence of memory on (Tn) for this random 
walk with modified gait is again asymptotically irrele- 
vant. 



B. Growth of the average number of visited sites 

We now turn to a more general excited walk; we shall 
use these results for the bipedal spider in Sec. IIII CI An 
excited walk is defined as follows: When the walker leaves 
a particular site for the first time it jumps at rate / for- 
ward (to unvisited sites) and at rate b backward (to re- 
visit sites). When the walk jumps from a site it had 
already visited earlier, it jumps at rates one in both di- 
rections. For concreteness we define its first ever jump 
from the origin to the right at rate / and to the left at 
rate b. This special rule, however, is irrelevant in the long 
time limit which is our main focus. 

The above definition slightly differs from the definition 
of the excited random walk studied in Refs. @, d, OH 
where it is a discrete time process, and where the excited 
walker is biased always into one specific direction (say 
always to the right). 

On the already visited sites, the walker performs a sim- 
ple symmetric random walk. In the following analysis, 
we will need certain results for the simple random walk. 
Particularly, for the walker on the interval [0, k] we will 
need the exit probability density <?fe(t|m) that the walker 
which starts at site m inside the interval, < m < k, 
exits at any of the two end sites of the interval during 
the time interval (t,t + dt). An elementary derivation 
of this probability density (more precisely, of its Laplace 
transform) is given in Appendix [A"l 

One can try to repeat the calculation of (Tn) in this 
more general situation. Instead, we consider a dual quan- 
tity, namely we shall compute how the average number 



of visited sites increases with time. Suppose that ex- 
ited walk had already visited k — 2 sites and it has just 
jumped to a new site. Let the already visited sites be 
1, . . . , k — 2, and let the random walker be currently at 
x = k — 1. (The latter assumption is acceptable since 
the rules are symmetric. This is not true for the excited 
random walk of Refs. [1,0, E3].) The probability density 
that the walker will visit the next new site (either site 
or site k) exactly t times later is 

oo 

Q k {t) = / e -( fe +/)< + b ( e-( b +» f 'g k (t - f'|2) dt' (13) 



The first term in this expression accounts for a direct 
jump forward from site k — 1 to k, since the walk stays 
at its original position k — 1 for time t with probabil- 
ity exp[— (b + f)t\. The second term describes the walk 
stepping backwards to site k — 2 after time t', doing a 
symmetric walk on sites 1, . . . , k — 1, and exiting this in- 
terval at either ends at time t with probability density 
9k(t ~ t'\2). The Laplace transform of (fl3|) is 



f + bg k (s\2) 
s + b + f 



(14) 



[The Laplace transform of the general exit probability 
density g k (s\m) is given by (|A5|) .] 

The above argument works also for the first ever step of 
the walk with our definition. If we prefer the assumption 
of hopping at rate / in both directions for the first time, 
we have to use Q2{s) — 2//(s + 2/). This, however, does 
not affect the long time behavior. 

The probability density that the walk visits the n th 
new site at time t is 



F n (t) 



J\Qk{t k )dt k 



(15) 



fe=2 



where t% + t§ + • • • + t n = t. The Laplace transform of 
the convolution (fTS"]) is 



Fn(s) = Y[Qk(s) 



(16) 



fe=2 



One anticipates that the main contribution in the long 
time limit comes from terms with k ~ \ft. This suggests 
to take the s — > limit while keeping ^fsk finite. In this 
limit g k (s\m) is given by (|A7|) and hence (Tj"4"|) becomes 



Q k (s) = 1 — a^fs tanh 



y/sk 



where we have used the shorthand notation 

2 



1 + //6 



(17) 



(18) 



Note that a and consequently the whole distribution 
F n (s) only depends on the ratio f/b of the hopping rates. 
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The no bias case corresponds to a = 1, which coincides 
with any symmetric hopping / = g. This has an inter- 
esting consequence — if the walk is only slowed down at 
new sites (but not biased), then it has no effect on its 
long time behavior, in agreement with the conclusions of 
the previous subsection Iff Al Note also that < a < 2. 

Now we return to the calculation of F n (s). The prod- 
uct in (fTrJ)) can be re-written as 



F„ 



exp 



\k=2 



In the s — > limit we use (TTTl) to obtain 



F n (s) — exp I — a\fs tanh 



fc=2 



fn the same limit the sum becomes an integral 



it 

/ 



tanh dk = —= In cosh ■ 



and hence 



F n {s) = cosh 



(19) 



(20) 



(21) 



(22) 



At this point we recognize that this generating func- 
tion is identical (up to a change in notation) to the one 
obtained in Q for a discrete time excited walk on the half 
infinite line. This allows us to use the results of Ref. @. 

Note that F n (s = 0) = / °° F n (t)dt — I since the walk 
will cover any number of sites with probability one. The 
cumulants p} can be directly obtained from the generat- 
ing function F n (s) to yield 

K l = aC l n u (23) 

with 

C e = 8 1/3 (1 - 2- 2e )(£ - l)!7T- 2 *C(2f) (24) 

For example, C x = 1/4, C 2 = 1/48, C 3 = 1/240. The 
average time to cover n sites, and its fluctuation, are 



«1 



<*> = T 



K-2 



(t 2 ) (tf = ^ 



(25) 



Note that these exact expressions have been already de- 
rived and given by equations (jTJ) and ([9]) in the symmetric 
a = 1 case. 

The distribution F n (t) attains a scaling form 

F n (t)=t- 1 9(/ i ), n=-* (26) 

in the scaling limit n, t — > oo with the scaling variable fi 
kept finite. The scaled distribution ^(/i) has been com- 
puted in Ref. Q, which in the present notation reads 



OO 

E 

fc=0 



-2a 
k 



{k + a)e-^ ( - k + a ^/ i (27) 
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FIG. 1: The scaled distribution (I27p of the number of visited 
sites by the exited walk for several values of bias. 



The average number of visited sites at time t is 
J °° nF n {t) dn 



(n) 



f™F n (t)dn 



A(a)Vt 



with 



oo 



(28) 



(29) 



where we have integrated ([27]) term by term, and used 
the duplication formula for Gamma functions. Note that 
A[a) is a monotonically decreasing function for a > 0. 
(This can be proved e.g. by differentiating lnA(a) and 
using the properties of the di-gamma function.) 

In the no-bias case (o = 1), equation (f2"9"| reduces 
to the well known result [lj| for the amplitude A(l) = 
4/y / 7r. In the limits of strong forward and backward bi- 
ases, the amplitude behaves as 



A{a) 



8 



Oil) 



for a <C 1 



(30) 



0(2 - a) for 2 - a < f 



Note that the naive inversion of the average time 
would result in the incorrect A = 2/s/a. 



III. BIPEDAL SPIDER 

Here we compute various characteristics of the simplest 
bipedal spider with s = 2. We shall see that memory 
affects the leading behaviors, and we will explain these 
findings by showing that the slowdown of legs on new 
sites leads to effective outward bias when the spider is at 
the boundary of the island of visited sites. 
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A. Average time to cover N sites 

Since the spider configuration is • o • when a new site is 
first visited, it is convenient to start from the same type 
of configuration 

...6666»o»6666... 

This assumption simplifies the calculation, but does not 
alter the asymptotic behavior. 

Denote by tn the time interval during which the num- 
ber of visited sites jumps from N + 3 to N + 4; then the 
number of visited sites first reaches N + 3 at the moment 
Tjv = J2o<j<N-i T j- We can assume that at Tn the vis- 
ited sites are 1, . . . , TV + 3, the spider's legs are at sites 
N + 1 and N + 3. Thus the configuration at time TV is 



...oooo---o»o»ooo. 



N 



(31) 



Similarly to ([5]) we write 

(t n ) = [1- (l + r)dt](T N ) +rdt 
+ dt(r N ) +dt 



3(2N + 1) 



(32) 



for the mean duration. The second term on the right- 
hand side accounts for the right leg of the spider hopping 
from site N + 3 to N + 2. The site N + 3 turns from the 
substrate into the product and the configuration becomes 



...OOO 0"'OIIOOOO.., 

N 

We term the mean position of the legs as the " center of 
mass" of the spider. Note that the center of mass of the 
spider is at position N + 3/2 and it makes ±1/2 hops 
with unit rates. The visit of sites 1 or N + 3 by the 
center of mass corresponds to visiting sites or N + 4 by 
the spider. The mapping to the random walk allows us 
to use 43} with M = 2N + 4 and x = 2N + 1 to yield 
3(2A + l)/2 for the average time. 

The third term on the right-hand side of Eq. (f3"2"|) ac- 
counts for the left leg of the spider hopping from site 
N + 1 to N + 2. Starting from this position, the mean 
duration time (tjv) before a new site is visited is found 
by writing an equation similar to Eq. 



(t N ) = [1 - (1 + r)dt}(? N ) + dt (t n ) + dt 



Simplifying Eqs. 



we obtain 



w \ 3(2JV+1) 
(1 + t)(t n ) = r -i — '- + [T N ) 



(l + r)(? N ) = 
from which we find 



2 

(tn) + 1 



(tn) = - + 



1 t 3(2N + 1) 1 + r 
r 



(33) 

(34a) 
(34b) 

(35) 



Plugging (|35[) into the sum ([7]) we obtain 

1 + r N 2 N 



(T N ) = 3 • 



r 2 



(36) 



If there is no difference between the substrate and the 
product and the hopping rates are equal to r everywhere, 
the average time is equal to 



(Tjv)uniform = N(N + 1) 



(37) 



Hence memory is important for spiders as it affects the 
leading asymptotic behavior of (Tn)- 



B. Effective bias 

We now argue that a slowdown at new sites leads to 
an effective bias for bipedal spiders. Imagine that the 
spider is in configuration (|3"Tj) . Analyzing various routes, 
we obtain that the right leg first hops to the left with 
probability 



1 



1 



1 



1 + r 1 



(38) 



and otherwise it hops to right with probability p = 1 — q. 
An explanation of ([38| is as follows. The first term on 
the right-hand side describes the probability that the first 
leg which is going to hop will be the right leg — this 
happens with probability and the leg ought to hop 
to the left. If the first leg to hop is the left leg — this 
happens with probability — it then has to hop back 

(the probability is again j-p^) an d then we are back to 
the initial configuration thereby providing us with factor 
q. Solving (f3"B"]) we find 



1 



q = 



2 + r ' 



P : 



(39) 



The probabilities p and q are somewhat misleading, 
e.g. if the right leg first hops to the left, the spider con- 
figurations are different — in the starting one • o • the 
legs are separated, while in the final configuration • • o 
the legs are in adjacent sites. It is easier to appreciate the 
probabilities p+ and p- that the spider first moves for- 
ward and backward, respectively, while the configuration 
remains the same. 

Let us rephrase this definition. First note that knowing 
the position x of the center of mass fully determines the 
position of both legs of the bipedal spider with s = 2. 
We assign x — to the initial configuration (|3"Tj) . The 
jump of any leg changes the position of the center of 
mass by ±1/2, hence x takes half-integer values. We are 
interested in the probability p + that the center of mass 
moves forward by one lattice site to x = 1 first, and in 
the probability p_ = 1 — p + that it moves backward to 
x = —1 first. 

We already know the probabilities of the first jump of 
the right leg (|3"5|) . With probability p the right leg jumps 



6 



to the right first, and the spider is already at x = 1. On 
the other hand, the right leg jumps to the left first with 
probability q, which puts the spider at x = —1/2. The 
spider then performs a simple random walk on the half 
integers, and reaches x — 1 first with probability 1/4. 
Collecting these terms we arrive at 



P+ = P 



l-r 
4(2+7 



(40) 



and p- = 1 — p+ . Thus the spider moves symmetrically 
when r = 1 (no memory), and as r decreases, the bias for- 
ward increases monotonically. The bias is the strongest 
in the r — > limit when p_ =3/8 and p + = 5/8. 




C. Growth of the average number of visited sites 

We now apply the above results to the bipedal spider. 
We approximate our spider by an effective walker that 
steps from already visited sites at rate one to both di- 
rections. The step length of this effective particle is 1/2. 
Therefore it visits only half as many sites of the origi- 
nal lattice as a particle whose step length is one. Thus 
(N) = A 2 {r) \ft with 



A 2 (r) = 



A(r) 



(41) 



Without memory we have A 2 (r = 1) = 2/y / 7r. 

From a new site, the effective particle hops predom- 
inantly in the forward direction. We approximate the 
ratio of the forward and backward rates of the effective 
particle by the ratio of the forward and backward prob- 
abilities, f/b = p + /p-. Using (|4H|) we conclude that 
parameter a, see (|T8l) . becomes 



3(1 + 
2(2 + r) 



(42) 



Using (f2"9")l and (|41[) - (|4"2"|) we arrive at the prediction for 
the average number of visited sites at time t: 



(N)=A 2 (r)Vt, A 2 (r) = 



3 + 3r 

4 + 2r 



5 + 4r 
4 + 2r 



(43) 



The amplitude A 2 (r) depends only weakly on r in the 
< r < 1 region. As r increases from to 1, A 2 (r) 
decreases monotonically from 



A 2 (r = 0) 



£(3/4) 
r(5/4) 



1.3519 



to A 2 (r = 1) = 2/y/ir w 1.1283. Fig.[2]shows an excellent 
(perhaps exact) agreement between the above theory and 
simulation results. 



FIG. 2: The amplitude A in the growth law (TV) = Ay/i as a 
function of r. For the bipedal L — 2 spider with maximal leg 
distance s — 2, simulations are in excellent agreement with 
the theoretical prediction <|43f) . The same is also valid for the 
L = 3 spider with S = 3, where the theoretical prediction is 
given by (ggj. 




FIG. 3: Simulation results for the diffusion coefficient as a 
function of rate r for the bipedal L = 2 spider with maximal 
leg distance S = 2, and for the L = 3 spider with S = 3. The 
dashed lines are the approximation (|44|l . 



D. Diffusion coefficient 

Up to now we have probed the spider's motion through 
the trace it leaves — the number of visited sites. This is 
the quantity which is measured in experiments In- 
terestingly, a direct analysis of the spider motion is much 
more challenging both experimentally and theoretically. 

One can of course proceed numerically. According to 
simulations, the position of the spider (averaged over 
many runs) follows a Gaussian distribution in the large 
time limit. This is somewhat counterintuitive since the 
spider slows down at the extreme positions. Overall this 
is a greatly simplifying feature since if the position is 
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(asymptotically) Gaussian, we can characterize the spi- 
der's motion solely by its diffusion coefficient D. 

Since the Gaussian distribution gives the position of 
the spider in the large time limit, one might try to ap- 
proximate its diffusion coefficient through the relation 
(N) = Ay/Dt/w which is valid for simple random walks. 
This leads to 



D 



nA 2 



(44) 



For r = 1 (no memory), the relation D = ttA 2 /16 is 
exact. For r < 1, however, it underestimates the spider's 
diffusion coefficient, see Fig. [3] 



The probability that the rightmost leg first jumps to the 
right is p — 1 — q. Following the same reasoning as in the 
L = 2 case, we finally arrive at 



1 (£-l)(l-r) 

2 2L[2 + r(L-l)] 



(47) 



and p_ = 1 — p+ . As r increases from to oo , the bias to 
the right p + decreases monotonically, and the motion is 
unbiased p + — 1/2 for the neutral r = 1 case. Hence, a 
slowdown at new sites (r < 1) leads to the outward bias 
for arbitrary L. ft is also clear from (|4T|) that the bias is 
weaker for larger spiders. 

To calculate the number of visited sites we first obtain 



IV. MULTI-PEDAL SPIDERS 

Spiders with an arbitrary number of legs L are diffi- 
cult to analyze. Here we consider global spiders which 
(by definition) satisfy the following constraint — the dis- 
tance between any two legs is < S. Since the site cannot 
accommodate more than one leg S > L — 1, and the spi- 
ders with S = L — 1 are immobile, so the first interesting 
case corresponds to S = L. These global spiders include 
the bipedal spiders studied in Sec. lIIII We now show that 
these global spiders can be analyzed generalizing the ap- 
proach of Sec. IIII1 

First we notice that knowing the center of mass posi- 
tion of a spider with S = L we can read off the position 
of all its legs. Hence as for the L = 2 case we can re- 
place the spider by an effective particle whose position 
coincides with the center of mass of the spider. The step 
length of this effective particle is 1/L. Consider a config- 
uration 

...6666oooo»»»oi6666... 

where the right leg has just stepped to a new site. We are 
interested in the first passage probability p + for the effec- 
tive particle to move one whole lattice spacing (L steps) 
to the right before moving one lattice spacing to the left. 
Denote again by q the probability that the rightmost leg 
will first jump to the left. Repeating an argument that 
led us to (1551). we arrive at 



Q = 



1 l + (L-2)r ,, c . 
£ 9> 6 =^77 7TZ (45) 



l + (Z-l)r 



Here e is the probability that after the right jump of 
the second leg from the right, the spider returns to the 
initial configuration without ever moving the rightmost 
leg. This simple first passage problem can be solved, see 
e.g. 0, to obtain ([35]). Form ijlSjl. the probability that 
the rightmost leg first jumps to the left is 



1 + (L-l)r 

2 + (£- l)r 



L + l 1 + (L- l)r 
+■ (L- l)r 



L 



(48) 



from l|47p. Recalling (|29|) and taking into account the 
normalization Al = A/ L we find the amplitude 



Al{t) 



r(a) 



L T(a + 1/2) 



(49) 



(46) 



in the growth law (N) = Ai,(r)^/i. An agreement be- 
tween theory and simulations is excellent, sec Fig. [2] 

Equation (|48[) shows that a = a(r, L) is an increasing 
function of both r and L. Since T(a)/T(a + 1/2) is a 
decreasing function of a, we conclude that for L = S 
global spiders the number of visited sites decreases as r 
or L grows. 

We believe that this is generally true for arbitrary spi- 
ders, i.e. A decreases as r or L grows. Intuitively it is 
easy to understand the monotonic L dependence, as an 
additional leg only blocks the motion of the others. The 
r dependence of A is more intriguing, since slowing down 
the spider actually results in more visited sites. One 
could argue that the slowdown at the boundary keeps the 
spider around the unvisited sites, hence it visits more of 
them. This argument, however, is flawed since the slow- 
down does not effect the behavior of a random walk as 
we saw it in Sec. [TTJ 

Here is an argument that supports the above conjec- 
ture. When the rightmost leg steps to a new site it has to 
stretch out a bit. If legs slow down on new sites, it gives 
other legs time to catch up, which leads to the outward 
bias. Another argument is that the when the rightmost 
leg stays longer at the new site, the second leg has more 
chance to step next to it, hence forcing the rightmost leg 
to the right. On the other hand, when the rightmost leg 
steps to a new site, the spider is a bit stretched out and 
possibly cannot step further due to some constraint (e.g. 
local or global). When the rightmost leg stays in place 
longer, it gives more time to the other legs to step closer, 
allowing the rightmost leg to step to the right, hence it 
leads again to a bias to the right. 
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V. CONCLUSION 

In experimental realizations spiders affect the en- 
vironment which in turn affects their motion. More pre- 
cisely, the state of any site on a surface changes irre- 
versibly after the first visit of a leg of a spider and then 
it remains the same. During the subsequent visits legs 
are bound less firmly to previously visited sites, and we 
model this feature by prescribing different hopping rates 
from sites visited in the past and sites visited for the first 
time. Namely a leg slows down on new sites while the 
spider remains agile inside a previously visited region. 
Thus the knowledge of the spider's entire trajectory is 
required to describe its motion. 

First, we considered a random walk (a one-leg spider) 
that slows down on the new sites. We calculated first 
two moments, (Tjv) and (1%), of the time Tjv to cover N 
sites, and also the dual quantity (N), that is the average 
number of sites visited until time T. All these quanti- 
ties exhibit the same leading asymptotic behavior as in 
the case of a simple random walk without slowing down. 
These concrete results clearly reveal the general truth, 
namely that the non-Markovian nature of the problem af- 
fects only sub-leading behaviors of the random walk with 
slowing down. All exact calculations support this conjec- 
ture, and we think that it is valid even in two dimensions. 
Heuristically, the above hypothesis relies on the crucial 
feature of a random walk in one and two dimensions — 
the random walk keeps visiting the same sites again and 
again (the average number of visits diverges with time), 
and therefore the slowing down on new sites becomes 
asymptotically less and less important. 

The above argument seemingly applies to spiders as 
well, so it was surprising that the computation of the av- 
erage time (Tjv) to cover N sites for the simplest bipedal 
spider with s = 2 gave a result that differs from the corre- 
sponding result for the same bipedal spider without slow- 
down. Thus, the slowdown affects the behavior even in 
the leading order. The especially striking feature is that 
the spider is effectively more agile, that is, the number of 
covered sites grows faster than for the same spider with- 
out slowdown. The mechanism leading to this accelera- 
tion is an effective bias towards unvisited sites. This bias 
is not apparent since legs hop symmetrically. Yet once 
the front leg steps onto a new site, it slows down and the 
hind leg hops to the neighboring site thereby effectively 
pushing (due to exclusion) the front leg further into the 
unvisited region. This argument does not fully explain 
the bias as the hopping processes are stochastic, but it 
does make the bias plausible. For the bipedal sider with 
s = 2, and more generally for the class of multi-pedal 
spiders with the maximal span equal to the number of 
legs (S = L), we computed the bias analytically. We be- 
lieve that the bias exists for all spiders; numerically we 
confirmed this belief in all cases we looked at, but this is 
still a conjecture which may be very difficult to prove. 

The bias explains the qualitative behavior of the av- 
erage time (Tjv), and it suggests that the dual quantity 



(N), that is, the average number of sites visited by time 
t, also grows faster than for spiders without slowdown 
on the new sites. This quantity is important as it is di- 
rectly measured in experiments |5j|. We computed (N) 
by replacing the spider by an excited walk, that is a walk 
that hops symmetrically from previously visited sites but 
biased towards the unvisited region when hopping from 
new sites. For this excited walk we calculated not only 
the average, but the whole probability distribution of the 
number of visited sites. The replacement of the spider by 
an excited walk is, however, not rigorous, but presumably 
exact due to the perfect agreement with simulations of 
the spiders. 

Apart from (Tv) and (N), we tried to compute the 
diffusion coefficient. Numerically we observed that the 
probability distribution of the spider position approaches 
a Gaussian distribution in the long time limit, and there- 
fore the diffusion coefficient fully characterizes the prob- 
ability distribution of the spider position. We neither 
proved that the probability distribution is (asymptoti- 
cally) Gaussian nor computed the diffusion coefficient. 
This is an interesting challenge for future work. 

Finally we note that models of synthetic molecular spi- 
ders are closely related to models of natural molecular 
motors [H [H H [H, [H [H HI. Memory effects can 
occur when motors change their environment, which then 
affects the motors motion. (This happens e.g. for the col- 
lagenase which moves along collagen fibrils [11].) The 
non-Markovian nature of these situations makes these 
problems challenging, nevertheless, some problems have 
been analyzed [12|, 123 ■ It should be possible to employ 
the methods developed here in the context of molecular 
spiders, and to imply them to the behavior of molecular 
motors in situations when memory effects are important. 
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APPENDIX A: ESCAPE OF A SYMMETRIC 
RANDOM WALK FROM AN INTERVAL 

We have a random walk on the integers which hops at 
rate one to the left, and at rate one to the right. The walk 
starts at time t = from site m, where 1 < m < k — 1, 
and site and k are absorbing. We are interested in the 
first passage probability distribution to reach either site 
or k for the first time. 

The position of the walk on the infinite lattice at time 
t is given by 0] 

q(n,t\m) = e- 2t I ln ^ nl (2t) (Al) 
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We then do a mirror trick. We start infinitely many 
walkers at time t = from positions n — 2ak + m, and 
'negative' walkers from n = 2ak — m, where a runs over 
all integers. Now their superposition gives the probability 
distribution of the walker in the interval 

oo 

p(n,t\m)= V" [q(n,t\2ak+m)-q(n,t\2ak-m)] (A2) 

a— — oo 

since, from symmetry reasons, p(0, t) = p(k, t) = for all 
time. 

The Laplace transform of the position distribution 
q(n, t\m) is 

/ \ \n— m\ 

(s + 2 - Js(s + 4) r \n-m\ 

Q(n, s\m) = ± : : ; ' = — — (A3) 

where we have used shorthand notation 



c=1+ W3Z±i) ) d=v ^T4) (A4) 



Hence we have to sum up a geometric series to obtain the 
Laplace transform of (|A2[) 



P(n, s\m) 



„\n— m| _ c n+m _ c — m\( c n _ c —n\ 



d d(l-c- 2k ) 
The first passage probability to site and site k is simply 

(c- c- 1 )(c m - k - c k - m ) 



L(s\m) = P(l,s|ra) 



R(s\m) = P(k - l,s|ra) = 



d(c k - cr") 
(c- c" 1 )^"™ - c m ) 
d(c k — c~ k ) 



respectively, and the probability to leave the interval at 
either end is <7fe(s|m) = L(s\m) + R(s\m). 

We are interested in the s — > 0, k — > oo limit, with 
k^fs and m being constants. Noting that in this limit 

c m -> l-myG + 0(s) 

c -™_ c m 2m^ + 0{ S Z/2 ) 

d -» 2V^+0(s 3/2 ) 
c k -> e- k ^ + 0(s) 

we establish the expansions 

L(s\m) — > 1 — m\fs cotanh k yi + O(s) 

(A6) 

i?(s|m) — > m vi cosech /c vs + O(s) 
Therefore the exit at either side of the interval becomes 

gk(s\m) -> 1 - m-v/s tanh— ^- + O(s) (A7) 

This asymptotic has been used in subsection III Bl 

Finally we notice that the diffusion approximation [l4j | 
of this simple random walk 

~. . sinh(/c — m)y/s ~. sinh mJ~s 

L[s\m) — - ; — j= , R(s\m) 



sinh k^/s 



sinh kyG 



leads to the exact asymptotic behavior (|A6|) in the lead- 
ing order. This property has been exploited in 0]. 
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